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We demonstrate that a stochastic model consistent with the scaling properties of financial assets 
is able to replicate the empirical statistical properties of the S&P 500 high frequency data within 
a window of three hours in each trading day. This result extends previous findings obtained for 
EUR/USD exchange rates. We apply the forecast capabilities of the model to implement an explicit 
trading strategy. Trading signals are model-based and not derived from chartist criteria. In-sample 
and out-of-sample tests indicate that the model performs better than a benchmark asymmetric 
GARCH process, and expose the existence of small arbitrage opportunities. We discuss how to 
improve performances and why the trading strategy is potentially interesting to hedge volatility risk 
for S&P index-based products, nomalous scaling; Memory; Intraday returns; Intraday strategy. 

I. INTRODUCTION 

Recent studies of high frequency (HF) data for foreign exchange (FX) rat es M , 0, regarded the many daily 
realizations of asset returns as constituting a statistical ensemble of histories [241 ]. The single HF time series from 
which such ensembles can be extracted are known to present clear periodic patterns, with period of one day [see for 
instance flTM [TlT ItH. and therein cited references]. For example, in the EUR/USD case, the volatility over successive 
10 minutes intervals monotonically increases or decreases during specific time windows within the day. This behaviour 
is observed by averaging the volatility on both a daily and a weakly basis. The idea in Baldovin et al. Q, Bassler 
et al. Q is to endogenize the periodic, nearly deterministic behavior of the EUR/USD exchange rate HF volatility 
into a time inhomogencous stochastic process for the evolution of the asset. The analysis of the ensemble of daily 
histories produced by such a process reveals also a peculiar form of scaling obeyed by the probability density functions 
(PDF) of the returns, once they are aggregated over intervals of variable span within a fixed intraday window [see 
also [l6[ . We recall that the scaling symmetry is a relation linking the marginal PDF of a stochastic process with the 
duration of sampling or observation interval. In the simplest case of time intervals starting at the beginning of the 
daily window, the scaling property observed for EUR/USD exchange rate implies that the PDF p(r, r) for a return r 
over an interval of width r, once multiplied by a factor t d , yields a specific function of r/r D : 

P(r,r) = -L ff (_L) , (1) 

where g is the scaling function, and D is a suitable scaling exponent. This form leads to p(r/X D , r/A) = \~ D p(r, r), 
for any arbitrary rescaling A of the interval duration r, and means that the PDF of the returns has a known structure, 
independently from the duration of sampling intervals. A process with such a property is thus called self-similar. 
As a consequence, if we observe an ensemble of histories generated by a self-similar stochastic process and data are 
available for instance at a 30 minutes frequency, the scaling relation makes it possible to construct the densities 
of the stochastic process at both finer and time-aggregated durations. Moreover, an HF self-similar process allows 
(unconditional) density forecasts for any future horizon (say from few minutes to several hours). Such a possibility 
is relevant for structural analysis of HF data, for financial forecasting, and, as we argue in the following, for the 
identification of efficient trading rules. 

The HF scaling symmetry ([1} addressed in Baldovin et al. Q, Bassler et al. Q has further interesting aspects: g 
is non-Gaussian and D is definitely lower than 1/2. On the basis of the stability of the Gaussian density under time 
aggregation, one would expect D = 1/2 and a Gaussian g for independent returns. On the other hand, a scaling 
with D ~ 1/2, but a non-Gaussian g, is generally found in the empirical analysis of the PDF of aggregated returns 
for intervals sliding along a single, long historical time series. The deviation from Gaussianity found for the PDF of 
returns sampled in this way is an anomalous scaling feature generally ascribed to long range dependence (as revealed, 
e.g., by the presence of volatility clustering and heteroskedasticity with persistent behaviours). However, for efficient 
markets one still has D ~ 1/2 p,|Tl[. In the HF EUR/USD case mentioned above, besides non- Gaussianity the strong 
deviation of D from 1/2 emphasizes the anomalous character of the scaling and implies the time-inhomogeneity (non- 
stationarity) of the returns' process. In particular, in Baldovin et al. Q the time inhomogencous scaling has been 
attributed to a non-Markovian, strong dependence of the intraday returns, as, e.g., manifested by the analysis of 
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the ensemble-averaged volatility autocorrelation function. This strong dependence amounts to an effective volatility 
clustering phenomenon, partially hidden by the fact that the average volatility reduces in time when D < 1/2. By 
imposing consistency with the anomalous scaling of the aggregated return PDF, a mathematical model has been 
then introduced for generating the histories in the ensemble. Within this model, the PDF of each return retains 
memory of the previous ones. Once properly calibrated, the model replicates very well the statistical properties of 
the empirical HF ensemble in the case of EUR/USD data. In particular, it is able to reproduce the patterns observed 
on autocovariance and autocorrelation functions of returns, absolute returns, and squared returns obtained from the 
HF time series. 

In the present paper we apply the non-Markovian model introduced in Baldovin et al. Q to the description of the 
HF data of a different financial asset, namely the S& P 500 index. A first goal is to show that the model is able to 
replicate the features of an asset different from FX exchange rates. More importantly, we will show how it might be 
used to develop an efficient intra-day trading rule, exposing a small arbitrage opportunity. This empirical application 
builds on the scaling property and is made under the assumption of correct model specifications. The process proposed 
in Baldovin et al. [5j, conditionally to a minimum information input, i.e. the opening price and eventually the first 
few 10-minutes returns, can be used to provide long-range density forecasts (for several steps ahead within agiven 
daily time range). We compute density forecasts also using a simpler and more standard GARCH process [J [Hjj; 
namely, the asymmetric GJR model [see[l3j]. The density forecasts obtained from our model and those of the GARCH 
benchmark are used to define price bounds whose violation gives a trading signal. The bounds might be interpreted as 
predicted supports and resistances, or they could be read as a simulated price range with a given confidence interval. 
Our approach thus belongs to the large plethora of Technical Analysis-based trading methods whose performances 
have been studies by different authors, Neely and Weller [T3|, Lo et al. [l8j . Park and Scott [l!|[25[, among others. 
However, the indicators used to derive the trading signals are model-based and not derived from a pure chartist 
approach. In-sample and out-of-sample tests demonstrate the existence of trading opportunities, leading, anyhow, to 
relatively small average margins of profit. As a matter of fact, those profits vary over time, reaching interesting values 
in specific periods and always beating the average profits associated with the GARCH model. 

The paper proceeds as follow. In subsection 1.1 we briefly describe the data used within this study, while Section 
2 is devoted to the presentation of the model and to parameter calibration. Section 3 deals with the construction of 
density forecasts, and Section 4 describes the trading strategy and provides the empirical results. Section 5 concludes. 

A. Data extraction 

In this work we focus on the analysis of the S&P 500 index. We consider a dataset ranging from September 30th 
1985 to October 20th 2010. After excluding those days for which the records are not complete (e.g., for holidays 
or stock exchange anticipated closures), the whole dataset includes M = 6283 trading days. For each single day I 
(1 < I < M), we extract the index values, {s l {t)} 1 , every 10 minutes between 9:40 a.m. (when we set t = 0) 
and 13:00 a.m. (t = 20), New York time. Our time unit, 10 minutes, is fixed by the characteristics of the dataset at 
our disposal. From one side it ensures that linear correlation effects among consecutive returns are negligible; on the 
other side, however, it poses a bias on the effectiveness of the strategy proposed in the following, since trading actions 
can be executed only with a certain time-delay induced by the 10-minute database discreteness. The empirical returns 
of the Z-th day on the time-scale r are then defined as r l (t, t) = In s l (t) — In s (t — r), t = 1,2,..., 20, 1 ^ r ^ t, and 
are regarded as specific realizations of stochastic variables R(t, r). In this way, our ensemble amounts to M = 6283 
realizations of the stochastic process S(i), with logarithmic unit increments R(t, 1) = In S(t) — ]nS(t — 1). Below, 
r(t, 1) refers to a generic realization of the random variable R(t, 1). Wherever appropriate, to simplify our formulas 
we will switch to the simpler notation R t = R(t,l) and r t = r(t,l). Similarly, r\ = r (t, 1) will indicate the t-th 
return on a 10-minute-scale in the l-th history realization of our ensemble. Our first task is thus to identify a proper 
analytical model for the joint PDF of the returns R\, i?2, • ■ • , Rt, PRi,R 2 ,...,R t ( r ij r 2, ■ • ■ , ft)-, for any t = 1,2, ... , 20, 
which correctly reproduces the statistics of the ensemble {r l A t=i,..., 20 ■ 

l=l',...',M 

II. THE MODEL 

In Bassler et al. Q , Seemann et al. [2(| it has been suggested that HF financial time-series for the FX exchange rates 
offer the opportunity to deal with many realizations of the same stochastic process every day. The set of daily histories, 
restricted to suitable time-windows, can thus be regarded as an ensemble for testing the statistical properties of the 
underlying process. In fact these histories are not completely independent, because heteroskedasticity and volatility 
correlations, if estimated by time averages along the whole time series s(t) from which the ensemble {s l (t)] l=1 M is 
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extracted, definitely exceed the intraday range. However, for a large enough total number of days M in the time series 
one can expect inter-day correlations effects to vanish, allowing for a reliable statistics of the postulated ensemble. 

The general ideas at the basis of the mathematical model adopted here to describe such an ensemble can be traced 
back in Baldovin et al. [||, Baldovin and Stella @. Financial market returns in the HF ensemble relative to the 
EUR/USD exchange rate are linearly uncorrelated if taken over intervals of 10 minutes or more. However, they are 
also dependent, as shown, e.g., by the nonzero volatility autocorrelation function. Based on these evidences and on the 
existence of time inhomogeneous anomalous scaling properties (sec below), it has been proposed that the joint PDF's 
of these returns have the form of convex combinations of the joint PDF's of processes with independent Gaussian 
increments of different, time dependent widths. This implies that the scaling function g of the aggregated return 
PDF is also a convex combination of Gaussians of variable width [i| , as often assumed in phenomenological studies 
of anomalous scaling [see[2l|, and cited references]. 

The main feature of the time-inhomogeneous model described in Baldovin et al. Q is the construction of the joint 
PDF Pflj H a jit , J"2, • ■ • ,r t ), on the basis of the scaling symmetry valid for the PDF of the aggregated return 

t 

R(t,t)=ln3(t)-lnS(0)=J2 R i- ( 2 ) 

»=i 

The PDF of R(t,t), Pn(t.t)( r )i satisfies a scaling symmetry with scaling exponent D and scaling function g, if, for 
every t, 

t D P R{ t,t){t D r)=g{r) (3) 

[this equation, in more precise notations, is equivalent to Eq. (JTJ]. Under the premises of this scaling symmetry, 
PRi,R 2 ,...,R t ( r ij r 2, ■ ■ • , ft) is then reconstructed as 

roo * exp (— 2 ^i a -i \ 
PR lt R 2 ,...,R t (ri,r 2 ,...,rt)= dap ( a ) TT a °* , (4) 



where 



« ee ^ - (i - ir ] 1/2 , ( 5 ) 



p(a) > is a PDF for a mixture of Gaussian distributions with different widths a [see, e.g., [lO, [23], and the scaling 
function is given by 

2 

g(r) = I p{a)^=da. (6) 
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The coefficients determine the time-inhomogeneity of the variables i?, 's, which is also manifest in a peculiar scaling 
form for its PDF PR 4 (ri), namely, 

at PR,( a i r i) = g(r). (7) 

Only for D — 1/2 the variables Ri's become identically distributed. See Baldovin and Stella @, Stella and Baldovin 
[2l| for additional details on the derivation of the joint PDF. 

The next step is the identification of a proper parametrization for the scaling function g. Since a is a measure 
of volatility, a convenient way of representing this fat-tailed scaling function in finance [see, e.g., is through an 
inverse-gamma density p: 

where ero > and a are a scale and a form parameter, respectively. As a simplifying assumption, we disregard the 
weak skewness of the return distribution, whose inclusion in the modeling framework is currently under development. 
The resulting scaling function g{r) has then a power- law decay for large |r| with exponent a + 1, whereas do simply 
sets the scale of its width. Within this parametrization, g is thus identified in terms of the two parameters ctq and a. 

In this paper we extend the application of this model to the S&P 500 index, working with a larger set of realizations 
(AI = 6283) with respect to the EUR/USD dataset used in Baldovin et al. @. The larger database enables us to 
reduce the statistical uncertainties of the empirical quantities. Interestingly, we find that the same general features of 
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FIG. 1: Scaling behaviour of the non-linear moments. 



the dynamics of the EUR/USD exchange rate also characterize the S&P 500 index and that they are both very well 
reproduced by the above model. In the Appendix we give an account of these results, which include the observation of 
a systematic decrease of the empirical second moment of the elementary intraday returns (over intervals of 10 minutes) 
as a function of time (time-inhomogeneity), the vanishing of their linear correlation at and above the elementary time- 
scale, and the presence of strong non-linear correlations. In the next Section we report instead the empirical evidence 
used to calibrate the three parameters D,oq. and a. 

A. Model calibration 

The scaling exponent D and the scaling function g play here a central role as they define respectively the a;'s and 
p(a) appearing in the joint PDF of a given daily realization of the process. For the purposes of this study, we adopt a 
two step calibration procedure. At first, we determine D, and then we use D in order to obtain a data-collapse which 
allows identifying p. 

A quantitative way to calibrate D is offered by the analysis of the moments of R(t,t). A simple integration shows 
that if PR( t ,t) (r) satisfies Eq. d3]), independently of g we have E [\R(t, t)\ q ] — t qD . Vg £ R. In Fig. Q] we plot the results 
of the linear regression, qD(q), of the logarithm of the time evolution of the empirical moments 

M 

^(Ml^^lAM)!' (0<q<2). (9) 
t=l 

The resulting regression slope D(q) = D ~ 0.36, identifies the value of D and gives a proof of the anomalous scaling 
symmetry [Eq. ([3])] satisfied by the empirical S&P 500 data. Quite remarkably, this value is very close to that 
estimated for FX exchange rates in Refs. Baldovin et al. Q, Bassler et al. Q, Seemann et al. [20j | . It is important to 
point out that we limit our scaling analysis to q < 2 because of an evident multiscaling behaviour for q > 2 [see also 

Taking advantage of the knowledge of the scaling exponent D = 0.36, a x 2 minimization algorithm shows that the 
values (To = 0.0045 and a — 3.69 adequately fit (see Fig. [2]) the data-collapse plot obtained via Eq. ([3]). These three 
parameter values, calibrated upon the entire available history, are used for in-sample evaluations of the model and 
subsequent trading strategies. 
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FIG. 2: The scaling function for the S&P after aggregation. 



However, we are interested in performing out-of-sample analyses also. In such cases, starting from our 25-years 
database, we decided to use the first 15 years of data (from 1985 to 1999) to identify specific values for the parameters 
(D,<?o,a) to be used to realize the trading strategy for the year 2000. We then repeatedly shifted year by year, 
always using the 15 previous years to calibrate the model, until 2009 (the last complete year). So, e.g., we used 
the data from 1994 to 2008 to fit the scaling function for the strategy to be used for year 2009. Fig. [3] shows the 
optimal parameter values used for the implementation of the out-of-sample strategies from 2000 to 2009. The scaling 
parameter D slightly increases over time and stabilize from 2005 to 2009. In contrast, the parameters characterizing 
the inverse-gamma density show evidence of a clear drop once the crisis period of 2008 is introduced in the calibration 
sample (see, e.g., the comparison between the inverse- gamma's of year 2005 and 2009 presented in Fig. SJ. 

III. DENSITY FORECASTS AND TRADING SIGNALS 

The model proposed here and in [B[ has implicit density forecasting capabilities. Indeed, assuming correct model 
specifications, given the parameter values and the first daily return, it determines the PDF of the daily evolution of 
returns. This determination is not limited to each single return, but extends to any aggregation of returns within the 
validity of the scaling symmetry. Such a possibility allows the construction of a trading strategy based on density 
forecasts. Furthermore, at each point in time within the chosen scaling window, the forecasts for the remaining part 
might be updated using the additional intra-daily information. For this reason, we consider in the following two 
different trading approaches: the first makes use of the opening value of the traded asset at the beginning of the 
chosen time- window, s'(0), and then bases only on this value the density forecasts for the remaining part of it (we 
call it Unconditional trading); in the second approach the density forecasts are updated using the information of the 
most recent returns (the Conditional trading, since the forecast is conditioned to the fresh intraday information). 
With both trading approaches it is possible to extract trading signals from density forecasts: given a certain quantile 
< q < 1/2, if at a certain time within the intra-daily range under study the observed market price is above (below) 
the 1 — q (q) quantile of the predicted price density we have a buy (sell) signal. 

A. Unconditional trading signals 

In this case, for each day I, we calculate the quantiles for the expected price at time t, conditioned to the opening 
value s l (0) only. Note that our approach, in computing the density forecasts, does not rely on specific previous-days 
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FIG. 4: Comparison between the inverse-gamma functions calibrated before and after the 2008 crisis. 



asset information. It is assumed that all the information contained in the past observations has been compressed in 
the model parameters, namely (D,ao,a). According to Eq. the PDF for the return PR^rt), is given by: 



PRi ( r i) 



dn,..., dri-i,,dri+i, ...,dr t PR lr ..,R t (ri, ■■■,r t ) 



da p(a) : 
^2ira 2 a 



(10) 
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FIG. 5: Upper and lower expected index values for the first day of the dataset (September 30th, 1985), confronted with real 
prices (in circles). Lines are linear piece- wise interpolations. 

The lower-bound values of the expected returns, rf mm , i = 1, ... ,20 for the lower quantiles q are then easily obtained 
by numerically solving the following equation with respect to r\ ' mm 

q= dx da p{a) ——=—. (11) 

J-oo Jo y'2TTa z a^ 

Due to the parity of the scaling function g(r), the corresponding upper-bound values r f max are simply obtained via 
sign flip: r q i ,max = — r f nm . The lower and upper expected values of S(t) can then be easily calculated, because S(t) 
is a monotonic function of the Ri's, namely, 

S(t) = S(0)e^ Ri , (12) 

and so the quantiles of S(i) are directly related to the quantiles of Ri, once the daily opening value is given. In order 
to simplify our notations, we will drop the explicit dependence of the quantiles of S(t) on the trading day, and indicate 

ii q,min j q,max 

them as sf and sf 

Summarizing, for every choice of q with < q < 1/2, for every time t from 1 to 20 within each trading day, two price 
barriers are obtained, s\' mm and s ^ max . With probability 1 — 2q the price at time t is placed between these values. 
For instance, in Fig.[5]the results of the in-sample analysis for the first day of the dataset (September 30th, 1985) are 
shown. As explained below in details, the comparison between these barrier values and the actual real market price 
lead us to the definition of a buy or sell action. 

The empirical analysis considers both in-sample and out-of-sample cases. The difference between the two is that for 
the former a unique set of values (D,ao, a) is used, calibrated with the whole 25-year dataset, while in the latter case 
the parameters (D, ctq, ol) are calibrated each year, on the basis of the previous 15-year history. For both approaches 
we will use the quantiles q = 0.05, 0.1, 0.25 (5%, 10%, 25%, respectively). 





FIG. 6: Upper and lower expected index values [q=10%] for one random day of the dataset (February 4th, 1994). As shown in 
the inset, different numbers of conditioning returns are considered. 



B. Conditional trading signals 



To exploit the non-Markovian character of our model, besides the opening value of the day, s l (0), we can use the 
value of, say, the first return of the day r\ to condition the subsequent expected evolution of the index. Using the 
model prediction for the joint PDF of the returns [Eq. ([I))], we can build a conditioned scheme where, for example, 
the first return is used to condition the successive probability densities: 



Jo°° d<jp(a) ]Ji= 



t oxp 



, , \ _ PRi R t {ri,..-,r t ) Jo ^ ) iii-2 

PR 2 ,...,R t \R 1 {r2,---,r t \ri) = = / \? \ ' 1 - 20 " ( 13 ) 

PRi vi) 



More generally, we can write the following expression, conditioning to the first t' returns the PDF of the successive 
t - t' returns, with 1 < t < 20 and t' < t: 

PR t , +1 ,...,R t \Ru...,R t , (rt'+i, ■ ■ ■ ,r t \ri,. . . ,r v ) 



PR u ...,R t , (ri,...,r t >) 
J °° dap(a) J] - =s+ 



exp 

J °° dap(a) nLi 



(14) 



The construction of the price barriers sf m% and s ^ max proceeds then as described in the previous section, simply re- 
placing the joint PDF p_R 1 ,i? 2 ,...,ij t (ri,r 2 , ...,r t ) with the conditional PDF p Rt , +u ...,R t \ Ru ...,R t , (n>+i, ■ ■ ■ , n\r\, . . . , r t >). 

In Fig. |6] some interesting features of conditional trading signals can be detected. In particular, it is clear the 
influence of the nearest past returns to determine the expected amplitude of the next ones. Conditional trading will 
thus be based on bounds which are not fixed within the day but may vary according to the new information included 
in the most recent market prices. 
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IV. DEVELOPING AND APPLYING INTRADAY STRATEGIES 



When the index value s l (t) breaks through the lower sf' mm or the upper s ^ max quantile, this violation can be 
used to define a trading strategy. Note that the ensemble property we observe lasts from 9:30 a.m. to 1 p.m. As a 
consequence, we define a trading strategy which operates within this time lapse. To avoid the impact of news arrivals 
between 1 p.m. and the opening of the next day, the trading strategy opens and closes positions within the day. 
Furthermore, since we are using a 10-minute dataset, density forecasts (and therefore the quantiles at level q) will be 
available from 9:40 a.m. . 

Within a certain day I, given its specific quantiles at level q, s|' mm and s ^ max ; the trading signals and the trading 
activity are defined as follows: 

A: If there arc no open positions 



B: If there are open positions 

B.i: Close a long position if s l (t) < s ^ max 
B.ii: Close a short position if s l (t) > sj' mm 

B.iii: Close long or short positions if they are still in place at 1 p.m. 

By construction, multiple trades arc possible within volatile days. Differently, in trending days, single operations will 
take place, while during stable days, no positions will be taken. 

Our purpose is to monitor the performances of the trading strategy defined above. Therefore, we simulate the time 
evolution of a trader using our strategy and having an initial cash amount equal to 1 million. Given the previous 
remarks about the validity of the ensemble properties and their link with the trading strategy, at the beginning of 
the day and at 1 p.m. the simulated portfolio is entirely composed by cash. Furthermore, in order to avoid losses 
larger than the portfolio value when implementing short positions, we limit the investment value to 90% of the overall 
portfolio value [26|. For symmetry, we apply the same rule also for long positions. As a result, when trades are created, 
10% of the portfolio remains in cash. Once a signal is observed, the trade is executed at the price s l (t). 

As mentioned above, three different quantile levels will be considered: 5%, 10%, 25%, and we simulate both 
unconditional and conditional quantiles for all these three levels. Specifically, we calculate price barriers using the 
opening price only (no conditioning), or conditioning also on the first 3, 6 or 9 returns. Higher numbers of conditioning 
points are not used, given that, within the day, the model proposed works only from the opening up to 1 p.m at a 
10-minute frequency, thus with a total of 21 observations per day. 



We first analyse the trading strategy in-samplc, in order to evaluate its abilities in terms of yearly profits and average 
return by trade (in basis points). At this stage, we also verify the impact of the different number of conditioning 
returns. Table (P) reports the average return of the trades generated by the strategy during the range from October 
1985 to October 2010. Profits are indicated as average basis points per trade, and distinguishing also between long 
and short trades. This allows to verify if the strategy better identifies signals in a specific direction. 

Three elements clearly appear. The first one is that the profit decreases with decreasing quantile values, as if 
stronger signals provide smaller performances irrespective of the trade sign. We explain such an unexpected result 
by the fact that, due to the 10-minute discreteness of our database, trades are not valued precisely at the price 
at which the barrier violation occurs, but with a certain delay. This time-delay (which could be diminished using 
an higher-frequency dataset) reduces the potential profit coming from relevant (and unexpected) price movements. 
Those, in fact, are the price changes detected by the model-based quantiles at the 5% level. 

The second relevant comment we make refers to the relation between average profit and conditioning information. 
We observe that conditioning the price barriers to the first returns of the day increases the profit. This is somewhat 
expected as during the first part of the day the model adapts its behaviour to the most recent data, and it provides 
therefore a better fit to the ensemble property within the remaining part of the day. Again, such a result holds 
independently of the trade sign. Regarding the number of conditioning values, we note that the relation between 



A.i: Buy if s l 
A.ii: Sell if s l ( 



'(*)>«? 
(t) < sf 



q.max 



A. In-sample results 
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TABLE I: Average profit per trade in basis points (1985-2010). 



25% 10% 5% 
All trades 
2.619 1.602 0.590 
3 3.788 2.842 1.862 
6 3.449 3.027 2.313 
9 3.243 3.070 2.184 

Long trades 
1.599 1.180 -0.483 
3 3.409 2.748 1.840 
6 3.094 2.942 2.564 
9 2.529 2.652 1.886 

Short trades 
3.577 1.977 1.470 
3 4.173 2.932 1.882 
6 3.824 3.110 2.087 
9 3.996 3.469 2.455 

The first column reports the conditioning elements (0 stands for no conditioning). The first row shows the quantile level. 

conditioning points and profits is not monotonic, and has a maximum between one and six conditioning points. As 
a consequence of this result and of the previous one about the quantile/profit relation, the out-of-sample analyses 
discussed in the next section have been performed with three conditioning points, which is the value with maximum 
profit for 25% quantiles. 

Table (jll| reports the same quantities as Table (jl|, but by year (we drop 1985 and 2010 where only part of the 
year was available). In general, the use of a 25% quantile provides the higher average profit. Furthermore, we observe 
that the largest part of negative values are identified in the range 2004-2007, where the market was clearly upward 
trending and in a low volatility phase. This is an expected outcome, since the model detects violations which are 
associated with high price fluctuations. The largest profits are located during 1986 and 1987, in a high volatility 
period. Overall, the average profit per trade is relatively small, and occasionally even negative. On the other hand, 
during some specific market phases the trading strategy provides large average profits per trade, see for instance the 
90's. 

Besides the impact of quantile level and number of conditioning returns, a third clement is of interest: the number 
and type of trades created by the strategy in a given time interval. Table reports several elements for the most 
recent years. The first column just repeats the content of Table Jill), while the following ones separately consider Long 
and Short trades distinguishing between true and false signals. We call "true" a trading signal which really provides 
a positive trade profit. The existence of "false" signals is due, as previously mentioned, to the discreteness of the 
dataset. Note that since in the trading simulations we execute trades at 10-minute frequency, the ratio true-to-false 
signals might not be optimal. Indeed, to execute trades few seconds after the violation of a quantile barrier would 
improve the model performances, potentially increasing the number of true signals with respect to the false ones. In 
light of this comment, the presence of substantial positive profits despite the relatively small number of true signals 
[see the last two columns of Table (|III[) ] should be considered as a positive, potentially interesting outcome of our 
strategy. A further element supporting the strategy is the average profit of trades originated by true signals. Both 
for long and short trades, these average profits are sensibly larger, peaking around about 80 basis points. On the 
contrary, false signals lead to trades with small losses (compared to the gains), even if these losses are larger during 
volatile market phases as in 2008 and 2009 (both for long and short trades). The number of trades is much larger 
when using 25% quantiles compared to 5% quantiles; while there are small differences between the use of Conditional 
and Unconditional quantiles. Long and short trades are almost numerically equivalent both in trending and volatile 
market phases. As naturally expected, overall the number of trades increases during volatile periods, irrespectively 
of the sign. Finally, also the number of relatively few true signals does not depend on the trade sign. In summary, 
Table (|III[) show evidence of some potential interest in the proposed strategy, since the average profit for true signals 
is quite elevate (in particular compared to the overall average profit). 

While the previous tables where focusing on the average return over single trades, Table (|IV|) focuses on the overall 
profit of the strategy over single years (assuming a starting cash amount of 1 million). The returns are reported in 
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TABLE II: Average profit per trade in basis points (yearly values). 



Year Unconditional Conditional 3 points Conditional 6 points 
Quantile 25% 10% 5% 25% 10% 5% 25% 10% 5% 



1986 


8.31 


6.06 


6.02 


11.39 


9.95 


9.96 


11.84 


12.21 


12.13 


1987 


12.32 


11.87 


12.79 


17.14 


15.50 


13.60 


15.04 


11.31 


12.11 


1 Q88 


3 9fi 


1 Q8 


1.24 


7 34 


fi nn 

u.uu 


4.02 


8 83 
0.00 


8 37 
0.01 


7 43 


1989 


5.11 


2.69 


-1.33 


6.88 


5.87 


7.83 


7.26 


5.95 


4.02 


1990 


9.00 


8.20 


5.58 


11.11 


10.19 


5.64 


10.90 


7.84 


3.62 


1991 


4.46 


1.13 


1.58 


6.35 


5.65 


2.55 


7.54 


7.48 


4.12 


1 QQ9 


3 1 fi 




3 7fi 


4.45 


^ 9Q 


7 71 


3 fi9 


3 78 


3 71 




1 <\A 


U.tJU 


9 3Q 


3 38 

0.00 




9 87 


3 1 ^ 


4.04 


3 sn 


1994 


1.90 


-0.13 


-3.79 


2.76 


1.48 


-0.89 


3.77 


2.10 


1.18 


1995 


0.08 


-2.36 


-11.08 


2.31 


1.46 


-0.26 


2.14 


1.23 


0.13 


1996 


0.84 


-1.63 


-2.36 


2.16 


0.89 


-1.28 


3.19 


2.37 


0.28 


1997 


2.62 


1.33 


1.48 


4.85 


3.90 


2.90 


3.81 


3.49 


1.51 


1 QQ8 


O.O I 






A 73 


n 3^ 


n 7fi 


3 Q1 


3 Qfi 


4.24 


1999 


1.48 


0.84 


-2.02 


2.82 


2.57 


0.79 


3.18 


6.09 


4.98 


2000 


0.02 


-2.81 


-0.88 


2.78 


0.81 


1.72 


0.32 


1.14 


3.74 


2001 


0.60 


0.29 


0.29 


2.69 


1.64 


2.23 


0.03 


-2.14 


-2.33 


2002 


2.65 


0.28 


0.35 


3.49 


0.17 


-0.62 


-0.55 


-0.77 


-2.64 


2003 


1.81 


0.38 


2.29 


0.40 


-0.30 


-1.96 


0.99 


-0.51 


-0.52 


2004 


-1.26 


-1.66 


-1.74 


-0.38 


-0.37 


0.16 


0.02 


0.90 


0.47 


2005 


-1.91 


-2.06 


0.52 


-1.23 


-0.25 


0.98 


-0.58 


-1.36 


-1.22 


2006 


0.38 


-0.34 


-1.20 


-0.68 


-1.63 


-1.70 


0.21 


-0.52 


-2.29 


2007 


-1.68 


-4.88 


-9.35 


-1.55 


-3.42 


-4.99 


-0.91 


-2.49 


-3.36 


2008 


4.06 


2.37 


-1.73 


4.37 


3.43 


-0.50 


3.03 


-1.08 


-2.16 


2009 


2.20 


4.23 


2.88 


3.48 


2.19 


2.63 


2.49 


3.88 


2.85 



percentages, and show evidence of positive performances in most periods. Comparing first the Conditional versus 
Unconditional quantiles, we observe that conditional modeling is clearly better: the associated returns are higher 
apart from few cases, and their standard deviation is smaller [22j. Contrasting the 25% and 5% quantiles, the use 
of narrower bands (25%) for the identification of the signals provide larger returns over the years. This potentially 
exposes the portfolio to a number of trades generated by false signals, but the profits coming from true signals balance 
them. Such a result holds irrespectively of the conditioning type. Finally, if we compare the performances of the 
trading strategy (25% quantiles) to that of the underlying equity index [see the last two columns of Table (|IV|)]. we 
note a relevant positive result: when the market is experiencing high volatility, our strategy provides positive returns 
with a volatility smaller than that of the market, and this is particularly evident when the market has yearly negative 
returns; on the contrary, when the market is in a low volatility period, our strategy has small or negative returns. In 
general, the trading strategy has always a volatility smaller than that of the market. This finding suggests that it 
could be used to hedge the market volatility, since it provides positive returns in case of high market volatility, and 
with smaller risk. This is further confirmed by the correlation between market and our strategy returns and between 
market and our strategy standard deviation [see the last row of Table PV|) ]: positive and very high correlation in the 
case of standard deviations, and low negative correlation for returns. 

B. Out-of-sample results 

The above promising in-sample performances are confirmed in the out-of-sample results. In this evaluation, we 
compare our model to a more traditional approach, based on G ARCH processes [see H, [l2[ • Shifting from the point 
of view of ensembles to that of financial time series, several elements characterizing high frequency data have to be 
considered. First of all, and most relevant, the entire daily set of high frequency returns is to be used, and not just 
the range 9:30 a.m. to 1 p.m. Second, the periodic behaviour of the intra-daily volatility has to be taken into account 
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TABLE III: Average profit per trade and number of trades: long/short trades, false/true signals. 
Average profit (bp) Number of trades 

All Long Short All Long Short % True 







All 


True 


False 


All 


True 


False 


All 


True False 


All 


True False 


Long 


Short 














25% 


- Conditional 3 points 












2005 


-1.23 


-0.10 


21.43 


-6.76 


-2.24 


15.63 


-7.15 422 


199 


47 


152 


223 


48 


175 


23.6% 


21.5% 


2006 


-0.68 


-0.58 


15.16 


-6.28 


-0.76 


15.29 


-5.65 438 


207 


55 


152 


231 


54 


177 


26.6% 


23.4% 


2007 


-1.55 


-4.01 


13.05 


-9.36 


1.73 


29.41 


-9.19 462 


264 


63 


201 


198 


56 


142 


23.9% 


28.3% 


2008 


4.37 


-0.67 51.60 


-20.27 


9.90 


66.53 ■ 


■18.77 505 


264 


72 


192 


241 


81 


160 


27.3% 


33.6% 


2009 


3.48 


2.57 


36.17 


-12.87 


4.42 


53.20 ■ 


■16.31 463 


235 


74 


161 


228 


68 


160 


31.5% 


29.8% 



5% - Conditional 3 points 



2005 


0.98 


2.36 


25.96 


-3.26 


-0.63 


9.69 


-1.92 


97 


52 


10 


42 


45 


5 


40 


19.2% 


11.1% 


2006 


-1.70 


-0.91 


15.26 


-4.69 


-2.25 


27.18 


-6.01 


90 


37 


7 


30 


53 


6 


47 


18.9% 


11.3% 


2007 


-4.99 


-6.31 


17.56 


-6.71 


-3.86 


11.24 


-8.34 


130 


60 


1 


59 


70 


16 


54 


1.7% 


22.9% 


2008 


-0.50 


-3.59 


31.85 


-16.55 


2.61 


61.09 


-19.88 


253 


127 


34 


93 


126 


35 


91 


26.8% 


27.8% 


2009 


2.63 


2.68 


53.40 


-10.21 


2.59 


53.19 


-13.78 


169 


79 


16 


63 


90 


22 


68 


20.3% 


24.4% 



25% - Unconditional 



2005 


-1.91 


0.36 


21.13 


-7.61 


-3.48 


15.40 


-8.67 


407 


166 


46 


120 


241 


52 


172 


27.7% 


21.6% 


2006 


0.38 


0.47 


19.63 


-7.50 


0.30 


26.70 


-7.78 


369 


177 


52 


125 


192 


45 


131 


29.4% 


23.4% 


2007 


-1.68 


-3.21 


16.58 


-9.91 


-0.13 


35.74 


-11.42 


479 


241 


61 


180 


238 


57 


169 


25.3% 


23.9% 


2008 


4.06 


-4.02 


57.18 


-28.56 


13.19 


71.86 


-21.66 


560 


297 


85 


212 


263 


98 


158 


28.6% 


37.3% 


2009 


2.20 


0.01 


50.94 


-23.19 


4.41 


55.57 


-19.72 


521 


262 


82 


180 


259 


83 


167 


31.3% 


32.0% 



5% - Unconditional 



2005 


0.52 


-0.75 14.39 


-6.25 


2.64 


26.78 


-0.38 


24 


15 


4 


11 


9 


1 


8 


26.7% 


11.1% 


2006 


-1.20 


-1.08 22.61 


-4.82 


-1.33 


22.42 


-9.81 


41 


22 


3 


19 


19 


5 


14 


13.6% 


26.3% 


2007 


-9.35 


-9.69 38.70 


-12.62 


-9.17 


18.42 


-16.98 


103 


35 


2 


33 


68 


15 


53 


5.7% 


22.1% 


2008 


-1.73 


-8.57 58.81 


-31.19 


5.59 


83.91 


-28.44 


362 


187 


47 


140 


175 


53 


122 


25.1% 


30.3% 


2009 


2.88 


5.61 62.38 


-14.47 


0.71 


53.57 


-18.05 


302 


134 


35 


99 


168 


44 


124 


26.1% 


26.2% 



[see 0,13, among others]. To capture these elements, together with variance asymmetry, we consider as a competing 
model an asymmetric GARCH, the GJR [see [l3| with a periodic deterministic variance component. Our choice is 
motivated by the relative simplicity of the competitor, a kind of benchmark, and by the possibility of easily generating 
from this model density forecasts at a given quantile under a distributional assumption for the model innovations. 
The competing model is given as follow: 

• the empirical returns on a 10-minute time scale are represented as: r\ = m l t e\, where t identifies the 10-minutc 
period within day I with a range which is now t = 1,2, ...T (T = 39 for our dataset), m l t is a deterministic periodic 
function, and e l t is the stochastic component; this model implies that returns are generated as R l t = N (0, m\ V c ) , 
where N(fi, a) indicates a Gaussian random variable of mean fj, and variance a, and V e is the (stochastic) variance 
of the random component; 

• ra\ is a periodic deterministic variance modeled similarly to Andersen and Bollerslev but using dummy 
variables instead of harmonics; we might represent returns as 

ln[(^) 2 ]=ln[(mj) 2 ]+ln[(4) 2 ], (15) 

with 

T 

\nl(m l t ) 2 }=a 1+ J2*Ai> ( 16 ) 

3=1 

where d t •, j = 2, ...T is a dummy variable assuming value 1 when j = I and zero otherwise, while ai , 02 • • • av 
are parameters to be estimated; 
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TABLE IV: In-sample yearly return and standard deviation compared with the S&P500 Index. 
25% - C. 3 p. 5% - C. 3 p. 25% - Unc. 5% - Unc. S&P500 



Return Dev.st Return Dev.st Return Dev.st Return Dev.st Return Dev.st 



1986 


40.47 


4.88 


11.36 


2 


.98 


33.53 


5.23 


5.06 


2.99 


14.620 


14.635 


1987 


79.71 


7.78 


21.58 


6 


.31 


61.65 


12.98 


24.05 


11.28 


2.028 


32.013 


1988 


26.55 


4.41 


5.33 


2 


.47 


12.14 


4.99 


0.97 


2.68 


12.401 


17.019 


1989 


23.91 


3.81 


7.67 


2 


.55 


16.45 


4.09 


-0.88 


1.82 


27.250 


13.006 


1990 


47.37 


4.98 


9.65 


3 


.19 


40.56 


6.21 


7.07 


3.83 


-6.559 


15.886 


1991 


23.62 


4.20 


3.43 


2 


18 


17.52 


4.86 


1.33 


2.37 


26.307 


14.242 


1992 


14.84 


3.26 


5.99 


1 


.68 


9.97 


3.29 


-1.82 


1.11 


4.464 


9.644 


1993 


10.39 


2.76 


2.27 


1 


.65 


4.06 


2.96 


-0.76 


1.27 


7.055 


8.567 


1994 


9.55 


3.04 


-0.67 


1 


.14 


6.03 


3.61 


-1.98 


1.40 


-1.539 


9.805 


1995 


7.54 


2.59 


-0.22 


1 


.36 


0.28 


2.93 


-2.80 


0.99 


34.111 


7.776 


1996 


7.87 


4.15 


-1.38 


2 


.14 


2.97 


4.68 


-1.61 


1.87 


20.264 


11.734 


1997 


20.57 


6.02 


4.86 


4 


.38 


12.49 


6.30 


2.06 


3.45 


31.008 


18.059 


1998 


20.03 


5.58 


-1.40 


3 


.26 


15.79 


8.29 


-0.07 


5.95 


26.669 


20.207 


1999 


11.98 


5.77 


1.53 


3 


.13 


6.65 


7.40 


-3.95 


3.90 


19.526 


17.999 


2000 


11.47 


8.30 


2.48 


5 


.05 


-2.08 


10.07 


-2.99 


6.39 


-10.139 


22.134 


2001 


11.63 


7.07 


4.26 


4 


41 


1.80 


9.57 


0.80 


6.18 


-13.043 


21.471 


2002 


15.22 


8.26 


-1.74 


5 


.40 


11.67 


10.65 


0.43 


7.68 


-23.366 


25.926 


2003 


1.90 


5.38 


-2.91 


2 


.80 


9.45 


7.79 


3.99 


4.80 


26.380 


17.000 


2004 


-1.61 


3.56 


0.20 


1 


74 


-4.92 


3.77 


-0.75 


1.05 


8.993 


11.049 


2005 


-4.68 


3.21 


0.97 


1 


.19 


-6.81 


3.39 


0.10 


0.57 


3.001 


10.243 


2006 


-2.82 


2.80 


-1.42 


1 


.15 


1.22 


3.70 


-0.48 


0.92 


13.619 


9.985 


2007 


-7.40 


3.90 


-6.06 


1 


.49 


-7.96 


4.83 


-8.67 


2.39 


3.530 


15.928 


2008 


19.95 


10.70 


-2.23 


6 


.99 


18.47 


15.45 


-8.63 


12.55 


-38.486 


40.810 


2009 


15.43 


7.97 


3.99 


5 


,43 


9.63 


10.29 


8.29 


7.68 


24.707 


27.177 


Corr. 


-0.08 


0.95 


0.04 


0.94 


-0.08 


0.98 


0.15 


0.98 







• furthermore, the stochastic term e\ follows a GJR model (Glostcn ct al. 1993) allowing thus for the decomposition 

e\ = a\ Z\, (17) 
where Z\ = N(0, 1) and the conditional variance is given by 

(a l t f = u J +(a Q + a 1 I{e\ < 0)) {e l t ? + ft (^) 2 , (18) 

where 

- {e l t f ~ (eU) 2 if t > 1 and [e l t ? = (e^ 1 ) 2 if t = 1, 

- {alf = (a^) 2 if t > 1 and (a}) 2 = (a^ 1 ) 2 if t = 1, 

— 7(e t < 0) is equal to one when e t is negative and zero otherwise, 

— u, ao, a\ and ft are parameters to be estimated. These parameters must satisfy the constraints for variance 
positivity and covariance stationarity tu > 0, ao > 0, a\ > 0, ft > and ao + 0.5ai + ft < 1 (under an 
assumption of symmetry for the density characterizing Z t . 

The estimation of the model proceeds by steps. At first the periodic component is estimated by linear regression using 
equations (| 15[) and (fTBj) . The fitted periodic component is used to recover the estimated values of e\. Over those, 
the GJR parameters are estimated by Quasi Maximum Likelihood approaches using a Gaussian likelihood. Given the 
estimated parameters, and under a Gaussian density for the innovations z\, we generate possible paths for the future 
evolution of the conditional variance (u\) 2 , of the innovations e t , and of the returns r\ (the periodic component m\ 
is purely deterministic and is thus simply replicated in the forecasting exercise). Under the distributional hypothesis, 
the needed quantiles are then determined and used as an alternative input for the identification of the trading signals. 
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TABLE V: Out-of-sample average profit per trade in basis points (2000-2010). 



25% 10% 5% 


All trades 


Unc. 


0.772 -0.058 -0.267 


Cond. (3) 


1.565 0.465 -0.098 


GARCH 


-1.488 -3.207 -0.628 


Long trades 


Unc. 


-1.131 -0.854 -1.260 


Cond. (3) 


0.962 0.250 -0.513 


GARCH 


-2.812 -3.950 -0.546 


Short trades 


Unc. 


2.597 0.661 0.613 


Cond. (3) 


2.145 0.655 0.284 


GARCH 


-0.489 -2.594 -0.696 



The first column reports the model: Unc. stands for our model with no conditioning; Cond. (3) refers to the model with a 
conditioning to the first three returns of the day; finally, GARCH identifies the conditional variance specification with 
deterministic periodic component. The first rows reports the quantile level. 



TABLE VI: Out-of-sample yearly return and standard deviation compared with the S&P500 Index. 





25% - 


C. 3 p. 


5% - C. 


3 p. 


25% - 


Unc. 


5%- 


Unc. 


25% - 


GJR 


5%- 


GJR 


S&P500 




Return Dev.st Return Dev.st Return 


Dev.st 


Return Dev.st Return Dev.st Return 


Dev.st Return Dev.st 


2000 


12.96 


8.36 


-1.29 


5.31 


-2.15 


10.09 


-3.46 


6.56 


-10.88 


4.88 


-1.95 


2.46 


-10.14 


22.13 


2001 


12.61 


7.19 


5.28 


4.35 


2.15 


9.49 


0.49 


6.28 


1.75 


5.62 


0.52 


3.00 


-13.04 


21.47 


2002 


16.21 


8.49 


-0.93 


5.53 


11.03 


10.76 


0.69 


7.68 


-7.83 


5.54 


2.29 


3.39 


-23.37 


25.93 


2003 


2.93 


5.35 


-0.61 


2.95 


8.43 


7.63 


4.13 


4.69 


9.39 


4.94 


-1.27 


2.17 


26.38 


17.00 


2004 


-1.08 


3.52 


-0.69 


1.60 


-6.60 


3.66 


-0.66 


0.99 


0.70 


1.69 


-0.44 


0.44 


8.99 


11.05 


2005 


-5.68 


3.00 


0.88 


1.13 


-6.80 


3.27 


0.06 


0.52 


-1.86 


1.38 


0.14 


0.25 


3.00 


10.24 


2006 


-3.08 


2.61 


-0.99 


1.04 


0.79 


3.56 


-0.84 


0.85 


3.11 


1.90 


0.09 


0.61 


13.62 


9.99 


2007 


-9.25 


3.91 


-4.96 


1.49 


-10.39 


4.79 


-8.71 


2.22 


-4.27 


1.61 


-0.98 


0.45 


3.53 


15.93 


2008 


22.73 


10.66 


-1.90 


7.70 


18.30 


15.46 


-8.43 


12.70 


-7.02 


4.22 


-1.49 


1.38 


-38.49 


40.81 


2009 


18.36 


8.17 


1.74 


5.41 


12.31 


10.15 


8.05 


7.50 


3.99 


4.91 


-0.78 


1.33 


24.71 


27.18 


Corr. 


-0.52 


0.95 


0.06 


0.96 


-0.30 


0.97 


0.62 


0.98 


0.78 


0.64 


-0.19 


0.45 







In Table (fV|) we report the out-of-sample average profit per trade, using the Unconditional and Conditional trading 
strategies as well as the one based on the GJR model. As mentioned earlier, the out-of-sample evaluation focuses only 
in the range 2000 to 2010, since the period 1985-1999 is used to calibrate the models. Results for our model are similar 
to in-sample outcomes, with conditional modeling providing better results. Both Conditional and Unconditional model 
specifications have performances largely better than the GJR model. The differences among the strategies appear 
more clearly in Table (|VI|) . which contains annual returns of the simulated portfolios. We note here that using the 
25% quantiles together with a conditioning on the first three returns of the day provides the best results in high 
volatility market phases (large than 20% annualized daily market volatility). On the contrary, when the volatility is 
lower, there is not a clear preference across the models (GJR included). The yearly volatility of the GJR is lower 
compared to our models as well as to the market, but the yearly profits are clearly unsatisfactory. In addition, the last 
row of Tabic (|VI[) reports the correlations between market and strategies returns and between market and strategies 
standard deviations: results confirm the previous in-sample findings for our strategies, while for the GJR we have a 
lower correlation in the standard deviations. 

Overall, the results suggest the existence of some trading opportunities with respect to the time series and sample 
size used. Furthermore, they show how the combination of advanced statistical methodologies could be used for 
the development of trading rules or trading schemes which have a large resemblance with those commonly used in 
technical analysis. 



15 



V. CONCLUDING REMARKS 

In this study we realized the first practical application of a non-Markovian model for asset evolution 0, Q , deduced 
by the scaling symmetry revealed by empirical analyses. The calibration of the parameters (D, a and a in the present 
case) has been facilitated by the opportunity of considering the intraday evolution of a given asset within specific 
time- windows, as an ensemble of different histories repeating each day 0, H(| . While previous studies demonstrated 
this property for FX exchange rates, in the present context we showed that it is possible to extend this interpretation 
and to verify that the model proposed in Baldovin et al. correctly reproduces the empirical evidence also for the 
S&P 500 HF data. 

In particular, performing both in-sample and out-of-sample analyses we demonstrated how the scaling properties of 
the stochastic process can be used to derive long-term (intraday) density forecasts. In turns, these density forecasts 
can be used to define trading signals and implement an intra-day trading strategy which exposes a small arbitrage 
opportunity. By comparing the trading outcomes to those obtained from a standard GARCH model, namely, the GJR 
[13J , we showed better performances for the trading strategy based on the proposed model. The average trade profit 
is limited over the entire time span, even though local levels might be higher. Further studies aiming at improving 
the trading strategy and the empirical application of the model are thus required and under development. 

Summarizing our findings, we can say that the proposed model has some potential for the development of trading 
strategies aimed at hedging the volatility risk, since their performances are positive during high market volatility, 
and characterized by a lower risk compared to the market index. Signals extracted from the model could also be 
considered as confirmatory signals for other strategies working with high frequency data, or could be used to detect 
relevant market movements. 

In this empirical example we do not consider several elements that could have an impact on the trading strategy 
profits. We motivate this by the need of evaluating the model in comparison to a simple benchmark. Across the 
elements we did not include, we have the trading costs. Once those are introduced, the profits reported in the 
previous tables would be sensibly reduced. However, the trading strategy we implement is based on a fixed frequency 
database, using a 10-minute interval. This has a relevant impact on the trading outcomes. In fact, if a quantile 
violation is observed at time t, we execute the trade with the price observed at this same point in time. However, 
the violation could have taken place in any instant in the ten minutes before t. A trader using our approach would 
produce quantiles to be used for each period of 10 minutes, but would immediately detect the violation, and operate 
in the market soon after it (assuming she/he fully trusts the signal). On the contrary, working with a fixed time 
span of 10 minutes, we lose part of the potentially relevant content of the signal, since the price at time t might be 
significantly different from the real price observed at the trade execution just after the violation occurred. 

Another element not included in our trading example is the remuneration of the bank account. In addition, overnight 
liquidity operations could be introduced given that the portfolio is entirely into cash from 1 p.m. of day t up to 9:39 
a.m. of day t + 1. Finally, we note that even the trading strategy could be improved, for instance introducing stop- loss 
and take-profit bounds on the implemented orders. 
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Where a statistical ensemble is a collection of elements, such as a collection of realizations of a stochastic process, with 
given statistical properties. In the cited contributions, each daily HF dataset constitutes a single element of the ensemble. 
Furthermore, each element of the ensemble is assumed to be a realization of the same underlying stochastic process. 
The properties of the process are estimated on the basis of ensemble statistics, i.e. by averaging over all available daily 
realizations. 

This paper provides an extensive survey of empirical studies dealing with technical trading rules. 

We do not take into account the margins generally required when creating short position. We motivate this choice by the 
need of evaluating the strategy abilities on both long and short trades without penalizing short positions, as would be 
the case when margins larger than 10% would be required. Furthermore, given that the trades will last at maximum for 4 
hours and 20 minutes (from 9:40 to 1 p.m.), we believe that an implicit margin of 10% will be sufficient. 
[27] The standard deviation is computed over the daily returns of the simulated portfolio and then annualized. Note that days 
without any trading signal provide zero returns, since we did not assume any remuneration for the bank account. 



Appendix - Testing model predictions 

First of all we show that the linear correlations 

CKn(t) = -======= (19) 

v /m 2 (i, l)m 2 (l, 1) 

already decay to an approximate zero value after 10 min (see Fig. [?])■ Indeed, we could have chosen a 5 min scale for 
the last few years, but in the years before 2000 this quick decay to zero of the correlation is not verified. To avoid 
dealing with linearly correlated sets, we assumed a conservative attitude and decided to work at the 10-minute scale. 

We then observe that the empirical second moment TO2(t, 1) = St=i i r t\ 2 /^ systematically decreases as a function 
of t in the interval considered (see Fig. [8|). This is a clear indication of return non-stationarity of the underlying 
process at this time scale. The model reproduces such a decay, and the empirical shape is pretty well fitted. The 




value D = 0.35, which best fits the shape of the curve, is in good accordance with the value D = 0.36 derived in the 
main text via analysis of non-linear moments. 

A widely studied indicator of non-Markovianity and of presence of non-linear correlations is the volatility autocor- 
relation, defined as 

theo M = E[H|r t |]-E[|n|]E[|r f |] 

vo1 [> ~ E[|n| 2 ]-E[|ri|] 2 ' 1 ' 



whose empirical counterpart is 



Fig. [9] pictures again the substantial agreement between the two (since we average over the whole ensemble, the 
parameters derived from the in-sample analysis have been used). 
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FIG. 8: Empirical second moment. 
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FIG. 9: Volatility autocorrelation. 



